Textual Inference for Retrieving Labeled Object Descriptions

نویسندگان

  • Alicia Tribble
  • Eric Nyberg
  • Carolyn Penstein Rosé
  • Bruce W. Porter
چکیده

This thesis presents a knowledge-based solution for retrieving English descriptions for objects in a collection. Based on detailed analysis of the errors made by a baseline system relying on surface-level features (i.e. term frequency), we infer that an ideal solution to this problem should use deeper representations of the meaning encoded in textual descriptions. Applied Textual Inference (ATI) as used in this thesis refers to the class of generic task-based evaluations that address this need. ATI tasks are challenge problems. Because they are intended to drive research on text understanding, the problems are designed to be hard enough to require reasoning. However in order to support cross-site comparisons of results, the problems are evaluated at the surface level. Examples include recognizing textual entailment (RTE), paraphrasing, summarization, word-replacement, and some types of question answering (QA). Deep representations and knowledge-based techniques have come to play an important role in state-of-the-art solutions for applied textual inference. However, to adapt them effectively for a particular ATI task, we must first understand that task and the details of why more shallow techniques fail. This thesis frames the problem of image description retrieval as an instance of ATI, and demonstrates how an inference engine and a set of symbolic knowledge resources in the form of ontologies can improve performance on this task, as measured by Mean Reciprocal Rank. In the process, we describe the results of several sub-tasks, each of which represents a contribution to the understanding of this problem and to the discovery and implementation of a knowledge-based solution: Introduce an image retrieval task supported by a data set containing over 50,000 images, hand-labeled with multiple descriptions; present a series of parameterizations for calculating the similarity between two descriptions; identify classes of

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Retrieving Top-k Prestige-Based Relevant Spatial Web Objects

The location-aware keyword query returns ranked objects that are near a query location and that have textual descriptions that match query keywords. This query occurs inherently in many types of mobile and traditional web services and applications, e.g., Yellow Pages and Maps services. Previous work considers the potential results of such a query as being independent when ranking them. However,...

متن کامل

SPATIAL INFERENCE AND CONSTRAINT SOLVING How to Depict Textual Spatial Descriptions from Internet

Today there are still many applications in the Internet, where the user is given a textual description of a spatial configuration (e.g. chat, e-mail or newsgroups). The user is asked to imagine the scene and to draw inferences. We present a new approach to generate depictions of such scenes. Besides of drawing spatial inferences, this leads to the problem of solving a system of complicated nume...

متن کامل

Link-based Classification using Labeled and Unlabeled Data

There has been a surge of interest in learning using a mix of labeled and unlabeled data. General approaches include semi-supervised learning and tranductive inference. In this paper we look at some of the unique ways in which unlabeled data can improve performance when doing link-based classification, the classification of objects making use of both object descriptions and the links between ob...

متن کامل

A Typed Representation and Type Inference for MPEG - 7 Media Descriptions ∗

MPEG-7 is a promising standard for multimedia content description. Adequate means for the management of large amounts of MPEG-7 media descriptions are needed in the near future. Essentially, MPEG-7 media descriptions are XML documents following media description schemes defined with an extension of XML Schema named MPEG-7 DDL. However, XML database solutions available today are not well-suited ...

متن کامل

MIS @ Retrieving Diverse Social Images Task 2015

In this paper, we describe our approach for the MediaEval 2015 Retrieving Diverse Social Images Task. The proposed approach exploits available user-generated textual descriptions and the visual content of the images in a combination with common, unsupervised clustering techniques in order to increase the diversification of retrieval results. Preliminary experiments indicate that the approach ge...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010